HIGH SPEED DATA TRANSFER BETWEEN MAINFRAME 
STORAGE SYSTEMS 



5 The present application is a continuation of application Serial No. 

09/774,435, filed January 30, 2001 , the contents of which are incorporated 
herein by reference. 

BACKGROUND OF THE INVENTION 
10 This invention relates generally to computer systems, and more 

particularly to methods and associated systems for transferring data between 
storage systems. 

DESCRIPTION OF THE BACKGROUND ART 

15 For back-up purposes, data stored in a disk unit of a local mainframe 

computer system are copied to a remote storage device to prevent data loss 
in the event of a disaster such as a disk crash or facility shutdown. U.S. 
Patent No. 6,098,129 to Fukuzawa et al. ('FukuzawaQ discloses a configuration 
for backing-up data from a mainframe computer system Cmainframd') to an 

20 open computer system. Although Fukuzawa discloses the use of low-cost 
open computer system storage devices for backing-up mainframe data, 
Fukuzawa does not disclose the use of another mainframe storage device for 
back-up. 

Because mainframes are generally more reliable than other types of 
25 computer systems, data stored in the disk unit of a mainframe are ideally 
backed-up to a disk unit of another mainframe. Remote dual copy functions, 
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which involve the backing-up of stored data from one computer system to 
another in real-time, have been performed between mainframes using the so- 
called Count-Key-Data ('CKD') protocol. The CKD protocol allows currently 
available mainframes to transfer data at a rate of approximately 17 MB/s 
5 (mega-bytes/second). To increase the amount of data that can be copied from 
one mainframe to another within a period of time, it is desirable to obtain 
a data transfer rate that is faster than what is currently obtainable using the 
CKD protocol. 

10 SUMMARY OF THE INVENTION 

The present invention relates to a method and associated systems for 
transferring data between mainframe storage devices. While the invention is 
suitable for remote dual copy functions, the invention may be generally used 
in applications requiring data transfers. 

15 In one embodiment of the invention, a local disk system of a local 

mainframe includes one or more local disk units. For back-up purposes, data 
in at least one of the local disk units are copied to a designated remote disk 
unit of a remote disk system. Data transfer between the disk units of the local 
and remote disk systems occurs over a fixed block infrastructure to increase 

20 data transfer rates. Accordingly, variable-length data received in the local disk 
system and destined to be backed-up to the remote disk system are first 
converted to fixed-length data prior to transmission over the fixed block 
infrastructure. In the remote disk system, fixed-length data received over the 
fixed block infrastructure are converted back to variable-length data. 

25 In one embodiment of the invention, a method for performing data 
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transfer between a local disk system and a remote disk system includes the 
steps of receiving variable-length data in the local disk system, converting the 
variable-length data to fixed-length data, sending the fixed-length data to the 
remote disk system, and converting the fixed-length data back to variable- 
5 length data in the remote disk system. The use of fixed-length data in the just 
mentioned method increases the data transfer rate between the local and the 
remote disk systems. 

These and other features and advantages of the present invention will 
be readily apparent to persons of ordinary skill in the art upon reading the 
10 entirety of this disclosure, which includes the accompanying drawings and 
claims. 



BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows a schematic diagram of a configuration for performing a 
15 remote dual copy function in an embodiment of the present invention. 

FIG. 2 illustrates the format of a track in Count-Key-Data (CKD) format. 
FIG. 3 illustrates the conversion of variable-length data to fixed-length 
data and vice versa in an embodiment of the present invention. 

FIG. 4 shows a schematic diagram of a configuration for performing a 
20 remote dual copy function in another embodiment of the present invention. 
FIG. 5 illustrates the structure of a copy pair information in an 
embodiment of the present invention. 

FIG. 6 illustrates the structure of a segment control block in an 
embodiment of the present invention. 
25 FIGS. 7A and 7B show a method for performing a remote dual copy 



3 



*> 

11 

t 

function in an embodiment of the present invention. 

FIG. 8A shows a schematic diagram of a configuration for performing a 
remote dual copy function in another embodiment of the present invention. 

FIG. 8B illustrates the structure of a segment control block in another 
5 embodiment of the present invention. 

FIGS. 9 and 10 show schematic diagrams of configurations for 
performing a remote dual copy function in other embodiments of the present 
invention. 

The use of the same reference number in different drawings indicates 
10 the same or like components. 



DETAILED DESCRIPTION OF THE INVENTION 

Turning now to FIG. 1 , there is shown a schematic diagram of a 
configuration for performing a remote dual copy function in accordance with 

15 an embodiment of the present invention. In a local mainframe 10A, data 

provided by a host system 1 1 A are stored in a disk unit 14A of a disk system 
13A. Host system 1 1A, which is the central processing unit of local mainframe 
10A, conventionally reads from and writes to disk unit 14A using variable- 
length data commonly referred to as a "record'. The well known 

20 Count-Key-Data (CKD') protocol provides a format for representing variable- 
length data in a mainframe. In this embodiment, host system 1 1 A provides 
variable-length data to disk system 13A via a CKD channel 12A. 

In general, the control software overhead of protocols using variable- 
length data is higher than that of protocols using fixed-length data (also 

25 referred to as 'fixed-length blocks or'fixed blocks). Thus, variable-length data 



4 



protocols such as CKD are generally slower than fixed-length data protocols 
such as the Small Computer Systems Interface (SCSI'). As a comparison, the 
data transfer rate of SCSI is 1 00 MB/s while that of CKD is only 1 7 MB/S. In 
the present invention, a fixed block channel 18 (e.g., SCSI channel) is 
5 employed to increase the data transfer rate between disk system 13A and 
disk system 13B. As shown in FIG. 1, disk system 13A includes a conversion 
function 15A for converting the variable-length data received from host system 
1 1 A to fixed-length data, which are then transported over channel 18 via a 
fixed block interface 17A (e.g., SCSI interface). In remote mainframe 10B, a 

10 fixed block interface 1 7B receives the fixed-length data from fixed block 

interface 17A. The fixed-length data are provided to disk system 13B, which 
includes a disk unit 14B for storage and a conversion function 15B for 
converting the fixed-length data back to variable-length data (and vice versa). 
As is well known, a record stored in a disk unit of a mainframe is 

15 located by specifying a cylinder number, a head number, a sector number, 
and a record number. The cylinder number identifies a magnetic disk in the 
disk unit while the head number identifies a read/write head. The cylinder 
number and the head number, together, identify a track, which is a circular 
region on the magnetic disk where individual records are stored. Each track is 

20 further divided into fixed-angled regions commonly known as sectors. A sector 
provides the general location of a record on a track, and thus facilitates the 
searching of a record. 

FIG. 2 shows the format of a track 51 in CKD format. Track 51 includes 
a Home Address ('HA') 200, gaps 204, and records R0, R1, R2, etc. HA 200 is 

25 located at the beginning of track 51 and contains control information for 
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accessing and identifying the track. As shown in FIG. 2, each field in track 51 
is separated by a gap 204. HA 200 and gaps 204 have fixed lengths. Each 
record further includes a count field 201 (i.e., 201 A, 201 B...), a key field 202 
(i.e., 202B,..), and a data field 203 (i.e., 203A, 203B,..). Count field 201 has a 
5 fixed length and contains record control information such as the record 

number, the length of key field 202, and the length of data field 203. Key field 
202 includes key information for accessing the user or system data stored in 
data field 203. When count field 201 indicates that the length of key field 202 
is zero, the record does not include a key field. To locate a record, the record 

10 number indicated in count field 201 is checked because the record numbers 
are not necessarily consecutive. That is, record R1 does not necessarily 
follow record R0, record R2 does not necessarily follow record R1, and so on. 

The conversion of variable-length data to fixed-length data, and vice 
versa, in accordance with an embodiment of the present invention is now 

15 described with reference to FIG. 3. As shown in FIG. 3, the contents of a track 
can be stored in a predetermined number of fixed-length blocks (i.e., fixed 
blocks) because the length of a track is fixed. Furthermore, the blocks that are 
in a sector 205 (i.e., 205A, 205B,...) are readily identified because the length 
of a sector is also fixed. That is, the fixed blocks for a particular sector can be 

20 found knowing the position of the sector relative to HA 200, the number of 

blocks per sector, and the number of sectors per track. In the example of FIG. 
3, the contents of track 51 are stored in fixed blocks 300A, 300B, 300C, 
etc. Fixed block 300A is referred to as the 'top block' and includes the contents 
of HA 200. Thus, the track represented by a set of fixed blocks 300 can be 

25 identified by looking up the track number indicated in the HA 200 stored in a 
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fixed block 300A. The fixed blocks following fixed block 300A are 
consecutively arranged to facilitate the conversion of the fixed blocks back 
into CKD format. That is, fixed block 300B follows fixed block 300A, fixed 
block 300C follows fixed block 300B, and so on. Thus, fixed blocks 300B, 
5 300C, 300D, etc. can be consecutively arranged to recreate the CKD 
formatted data once the matching fixed block 300A is found. 

As can be appreciated by persons of ordinary skill in the art reading the 
present disclosure, fixed blocks 300 are suitable for transportation using a 
fixed block protocol such as SCSI. For example, each fixed block 300 can be 

10 assigned a unique SCSI logical block address (LBA) because the number of 
fixed blocks in a track and the number of tracks in a disk unit are fixed. Thus, 
assuming that each track has 100 fixed blocks, an LBA of 3521 may be used 
to identify the 22nd block in the 35th track. 

FIG. 4 shows a schematic diagram of a configuration 150 for 

15 performing a remote dual copy function in another embodiment of the present 
invention. As shown in FIG. 4, a local host system 102, which is the central 
processing unit of a local mainframe 100, provides CKD formatted data to a 
local disk system 104 via a CKD interface 1 19A. Local disk system 104 further 
includes disk units 112A (i.e., 112A-1, 112A-2,...) where data are stored, and 

20 a local disk control unit 106 for controlling disk units 1 12A. 

Local disk control unit 106 includes a cache memory 1 13A where data 
that are in transit or frequently accessed are temporarily stored before being 
written to a disk unit 1 12A. Data in cache memory 1 13A are organized in 
segments (i.e., segments 116A-1, 116A-2,...), with each segment having 

25 enough space to hold the entire contents of a single track. 
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Local disk control unit 106 also includes a mainframe read/write 
process 108A for processing disk read and write commands received from 
local host system 102, a data send process 109 for sending data to remote 
mainframe 101 , and a disk unit read/write process 1 1 1 A for transferring data 
5 between disk units 1 12A and cache memory 1 13A. In this disclosure, the term 
'|Droces3' includes hardware, software, and/or firmware for performing the 
indicated function. All of the just mentioned processes can access a shared 
memory 114A, which contains multiple copy pair information 117A (i.e., 117A- 
1, 117A-2,...)and segment control blocks118A (i.e., 118A-1, 1 18A-2,...). A 

10 CKD/FBA conversion function 1 15A, which is generally available to all 

processes of local disk control unit 106, is called by read/write process 108A 
to convert CKD formatted data to fixed blocks and vice versa. In one 
embodiment, CKD/FBA conversion function 1 15A employs the technique 
described in connection with FIG. 3. 

15 A copy pair information 1 17A identifies a disk unit in remote mainframe 

101 that is designated as a back-up of a disk unit in local mainframe 100. FIG. 
5 shows the structure of a copy pair information 1 17A. Referring to FIG. 5, a 
local storage system address 400 specifies a local disk system in local 
mainframe 100 (e.g., local disk system 104). A disk unit address 401 specifies 

20 a disk unit in the local disk system. Similarly, a remote storage system 
address 402 and a disk unit address 403 specify a remote disk system in 
remote mainframe 101 (e.g., remote disk system 105) and a disk unit in the 
remote disk system, respectively. The contents of the disk unit specified in 
disk unit address 401 are copied to the disk unit specified in disk unit address 

25 403 during a remote dual copy function. 
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In configuration 150 shown in FIG. 4, each segment control block 118A 
contains information relating to a corresponding segment 116A stored in 
cache memory 1 13A. FIG. 6 shows the structure of a segment control block 
118A in configuration 150. A disk unit address 500 specifies a disk unit 112A 
5 where storage space is allocated for the segment 1 16A. As mentioned, a 
segment 1 16A has enough space to hold the entire contents of the allocated 
track. If cache memory 1 13A is organized in terms of fixed blocks, as is the 
case in configuration 150, a segment 1 16A has enough space to hold all the 
fixed blocks of a track. A top block address 501 indicates the top block 

10 address of the track for which a segment 1 16A is allocated, and can thus be 
used to locate the segment 1 16A. 

As shown in FIG. 6, a segment control block 1 18A also includes a 
block bitmap 502, a remote write bitmap 503, and a local write bitmap 504. 
Each bit of bitmaps 502, 503, and 504 corresponds to a block of the segment 

15 1 16A identified by top block address 501. Accordingly, the number of bits of 
each of the just mentioned bitmaps is equal to the number of blocks in a 
segment 1 16A. Each bit of block bitmap 502 indicates whether the 
corresponding block is in cache memory 1 13A; i.e., when a bit of block bitmap 
502 is ON, the block which corresponds to that bit is in a segment 1 16A in 

20 cache memory 1 13A. The bits of remote write bitmap 503 indicate whether 
the corresponding blocks need to be written to a disk unit in remote disk 
system 105. That is, when a bit of remote write bitmap 503 is ON, the block 
which corresponds to that bit is to be transmitted to remote disk system 105 of 
remote mainframe 101. Similarly, each bit of local write bitmap 504 indicates 

25 whether the corresponding block needs to be written to a disk unit of local disk 
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system 104. 

Referring to FIG. 4, a fixed block interface 120A transports fixed blocks 
to remote mainframe 101 over a fixed block infrastructure 121. In one 
embodiment, interface 120A is a SCSI interface, and infrastructure 121 is a 
5 SCSI infrastructure that includes SCSI cables, line drivers, adapters, 

repeaters, etc. To process the fixed blocks received over infrastructure 121 , 
remote disk system 105 includes components that mirror those of local disk 
system 104. That is, remote disk system 105 also has a data receive process, 
a mainframe read/write process, a CKD/FBA conversion function, a cache 

10 memory, a shared memory, a disk unit read write process, and a CKD 

interface that are similar to those in remote disk system 105. In the present 
disclosure (including in FIG. 4), the same or like components are labeled with 
the same reference numeral. For example, shared memory 11 4A of local disk 
system 104 is similar to shared memory 1 14B of remote disk system 105. 

15 A method for performing a remote dual copy function in accordance 

with an embodiment of the present invention is now described with reference 
to FIG. 7A, FIG. 7B, and FIG. 4. Referring to FIG. 7A, a remote dual copy 
function is initiated when read/write process 108A receives a Define Extent 
command from local host system 102 (step 701). As is conventional, the 

20 Define Extent command includes information for processing forthcoming 
Locate and Read/Write commands such as cache memory utilization mode 
etc. After receiving the Define Extent command, read/write process 108A then 
receives a Locate Command (step 702). As is conventional, the Locate 
command specifies a record to access by providing a cylinder number, a head 

25 number, a sector number, and a record number. The cylinder number and the 
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head number, together, identify a particular track in a disk unit. To determine if 
there is a segment 1 16A allocated for the track specified in the Locate 
command, read/write process 108A checks the top block addresses 501 of 
the segment control blocks 1 18A (step 703). Note that read/write process 
5 1 08A may utilize CKD/FBA conversion function 1 1 5A to convert CKD 
formatted data to fixed blocks and vice versa. 

If a segment 1 16A is allocated for the track, read/write process 108A 
checks the block bitmap 502 of the corresponding segment control block 
1 18A to determine if fixed blocks belonging to the sector specified in the 

10 Locate command are in cache memory 1 13A (step 704). 

If the blocks corresponding to the sector number are not in cache 
memory 1 13A or if a segment 1 16A is not allocated for the track specified in 
the Locate Command, a segment 1 16A and corresponding segment control 
block 1 18A are created for the track (step 706). Thereafter, the contents of the 

15 track are loaded from the disk unit 1 12A specified in the Locate command to 
disk unit read/write process 1 1 1 A (step 707), converted to fixed blocks (step 
708), and then stored in cache memory 1 13A in the allocated segment 1 16A 
(step 709). 

Once it is established that the contents of the track are in cache 
20 memory 1 1 3A, read/write process 1 08A finds a record in a disk unit 1 1 2A 
where write data from a forthcoming Write command is to be written (step 
705). Subsequently, read/write process 108A receives the Write command 
that goes with the previously received Define Extent and Locate commands 
(step 710). Read/write process 108 converts the write data that accompany 
25 the write command from CKD format to fixed blocks (step 71 1 ), stores the 
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converted write data to cache memory (step 712), and then sets the 
corresponding bits in remote write bitmap 503 and local write bitmap 504 
(step 713). At a later time, disk unit read/write process 1 1 1 A conventionally 
writes the fixed blocks identified in local write bitmap 504 to their respective 
5 disk units 11 2A. 

Continuing with step 714 shown in FIG. 7B, data send process 109 
checks the bits of the remote write bitmaps 503 in shared memory 1 14A to 
find the fixed blocks that need to be sent to remote disk system 105. Data 
send process 109 uses the information in a copy pair information 1 17A to 

10 determine the remote disk unit designated to receive the fixed blocks (step 
715). Data send process 109 sends the fixed blocks to remote disk system 
105 via fixed block interface 120A and over fixed block infrastructure 121 
(step 716). Because fixed block interface 120A, fixed block interface 120B, 
and infrastructure 121 are based on SCSI in this embodiment, each fixed 

15 block is assigned a unique logical block address. 

In remote disk system 105, a data receive process 110 receives the 
fixed blocks via a fixed block interface 120B (step 717). Data receive process 
1 10 then checks the top block addresses of segment control blocks 1 18B to 
determine if there is a segment 1 16B allocated for each received fixed block 

20 (step 71 8). If a segment 1 1 6B is not allocated, a segment 1 1 6B and a 

corresponding segment control block 118B are created for the fixed block 
(step 719). Data receive process 110 then stores the fixed blocks in their 
respective segments 1 1 6B (step 720). Thereafter, data receive process 
110 sets the corresponding bits in the block bitmap and local write bitmap of 

25 the segment control block 1 18B (step 721 ), and notifies data send process 
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109 that the fixed blocks have been received and processed in remote disk 
system 105 (step 723). In response, data send process 109 resets the 
corresponding bits in the remote write bitmap 503 in local disk system 104. At 
a later time, disk unit read/write process 1 1 1B in remote disk system 105 
5 conventionally writes the fixed blocks identified in the local write bitmap of the 
segment control block 1 18B to their respective disk units 1 12B. 

FIG. 8A shows a schematic diagram of a configuration 250 for 
performing a remote dual copy function in another embodiment. In contrast to 
configuration 150, cache memory 213 (i.e., 21 3A, 21 3B), segment 216 (i.e., 

10 216A, 216B), and segment control block 218 (i.e., 218A, 218B)of the 

mainframes in configuration 250 are configured to process CKD formatted 
data. That is, each segment 216 of a cache memory 213 has enough space to 
hold the records of a single track in CKD format. 

In configuration 250, each segment 216 has a corresponding segment 

15 control block 218. FIG. 8B shows the structure of a segment control block 218 
in configuration 250. As shown in FIG. 8B, a disk unit address 800 specifies a 
disk unit 1 12 where storage space is allocated for the segment 216. A track 
address 801 contains the address of the track allocated for the segment 216. 
A segment control block 218 further includes a record bitmap 802, a 

20 remote write record bitmap 803, and a local write record bitmap 804. Each bit 
of the just mentioned bitmaps corresponds to a record stored in the 
corresponding segment 216. Accordingly, the number of bits of each of the 
just mentioned bitmaps is equal to the maximum number of records in a track. 
Record bitmap 802 indicates whether a record is in a segment 216. 

25 When a bit of record bitmap 802 is ON, the record that corresponds to that bit 
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is in a corresponding segment 216. 

Remote write record bitmap 803 indicates whether a record in the 
corresponding segment 216 needs to be written to a disk unit in the remote 
disk system (which is identified in a copy pair information 117 similar to that 
5 used in configuration 150). When a bit in remote write record bitmap 803 is 
on, the record that corresponds to that bit is transmitted to the remote disk 
system. 

Local write record bitmap 804 indicates whether a record in the 
corresponding segment 216 needs to be written to a disk unit in the local disk 

10 system. When a bit of local write record bitmap 804 is on, the record that 
corresponds to that bit is written to a disk unit in the local disk system. 

In configuration 250, CKD formatted data from local host system 102 
are not converted to fixed blocks until the data are ready to be transmitted to 
remote disk system 105. Accordingly, data send process 109 calls CKD/FBA 

15 conversion function 1 15A to convert the CKD formatted data to fixed blocks 
before handing the data to fixed block interface 120A. In remote disk system 
105, data receive process 110 calls CKD/FBA conversion function 1 15B to 
convert the fixed blocks received over fixed block infrastructure 121 back to 
CKD format. 

20 As is evident from the foregoing, configuration 250 and configuration 

150 are similar except for the use of CKD formatted data in the cache memory 
of configuration 250. Persons of ordinary skill in the art will appreciate that the 
present invention can be employed regardless of the cache memory 
management scheme. For example, FIG. 9 shows a configuration 350 where 

25 the local disk system uses a fixed block cache management scheme similar to 
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that used in the local disk system of configuration 150, whereas the remote 
disk system uses a CKD cache management scheme similar to that used in 
the remote disk system of configuration 250. Similarly, FIG. 10 shows a 
configuration 450 where the local disk system uses a CKD cache 
5 management scheme similar to that used in the local disk system of 

configuration 250, whereas the remote disk system uses a fixed block cache 
management scheme similar to that used in the remote disk system of 
configuration 150. 

A method and associated systems for transferring data between 

10 storage systems for mainframe computers have been disclosed. While 

specific embodiments have been provided, it is to be understood that these 
embodiments are for illustration purposes and not limiting. Many additional 
embodiments will be apparent to persons of ordinary skill in the art reading 
this disclosure. For example, while the invention is suitable for use in remote 

15 dual copy functions, the invention is not so limited and may be generally used 
in applications requiring data transfer between storage systems. Thus, the 
present invention is limited only by the following claims. 
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